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(54)Title: HUMAN GALECTIN-4-HKE PROTEIN AND cDNA ENCODING THE SAME 



(57) Abstract 

A galectin-4 (a lactose-binding protein)-like protein expressed specifically in the stomach and intestines and a human cDNA encoding 
the same. The protein is one containing the amino acid sequence represented by SEQ ID NO: 1 while the gene is a cDNA containing the 
base sequence represented by SEQ ID NO: 2, The protein which is the expression product of this cDN A has a lactose-binding activity and is 
applicable to drugs and research reagents. 



(5 7) £ft 



AM TA^=T 

at t-xhyr 

AU t-^h7P7 

AZ T 

BA =#;* = T ■ ^/U*y=adt tf-f 

B B /</w< 

B E -</U*- 

B F ~7fr*1r - 77V 

BG ^/u^fyr 

B J -<^"^ 

BR 

BY -<7^-v- 
CA 

CF W7y**fti 

CG =V=f 

ch x>r^ 

CI = — h • S?sl?T— A* 

CM ^^/l^-V 

CN *B 
CU 

CZ ^y=f£r*I@ 



DE 




L I 


DK 


'TIS-?— 9 


LC 


EE 


b-T 


LK 


E S 


x^-Y is 


LR 


F I 




LS 


F R 




LT 


GA 




LU 


GB 


-r^y ^ 


L V 


GE 




MC 


GN 




MD 


GR 


*y 


MG 


HU 


y- 


MK 


I E 


7^f^7V K 




I L 




ML 


I S 




MN 


I T 


-r * yr 


MR 


J P 




MW 


KE 




MX 


KG 




NE 


KP 




NL 


KR 




NO 


KZ 


j&U 8 7^^ ^ 


NZ 



y tr^i/a^^y P L 

tyF;i/->7 PT 

xy?^ ro 

y^yr ru 

u-y V SD 

y ^7-7 SE 

/U^-feV^/V^ SG 

7h^7 SI 

^■^=r SK 

*/t-K!/7*ftI SN 

-?yjfxjt7/l> SZ 

K=Tia^—=^X7 TD 

^T£=fTJ@ TG 

-e y T J 

TM 

^e—y^r-y TR 

■*>7*-f TT 

y^-^a UA 

— /!✓ UG 

x-^^y US 

/U^ji— U Z 

^zl- - i?— Vis K VN 



3K— 7 is K 

7D^x^7 

xn>/7^7 

^7S?7^K 
K 

h y K - h/<=r 
r^t y 

74= b-TJ* 



WO 97/03190 



i 



PCT/JP96/01899 



m m m 



5 

fflMM 

:3-KT3#V^>-4#g6KCT-f3o *|gHJ©t hcDNAJi. m&Tim 

trisp^i&f?? b-xtmttzmMis97>(DmFZ'$>z 0 warns 

dr'f'M-^LTl ^ t*s*. t>tlTl ^[Drickamer, K. , Annu. Rev. Cell 
Biol.. 9:237-264(1993): o Z(D^\ #V^f->- 4ii^ y h©Jl§©ttt±i^ 

>- 4 £rr— Kf £cDNA£-^ d— Wfc£ftTt# ^SMOda, Y. et al. , J. 
Biol. Chem. ,268:5929-5939(1993)]. t h#V^>- 4}f36M£:3- Kf 5 
cDNACOl vCCDfR^Jittt \> 

^wjjt ^(m&w^©7gm. #v ? ?- > - 4 3 - K-r * t kcdxa 
2 3 -ra^nsig^K^j^^cDNA^sifitr So 
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t hcDNAte. t h«fi^cDNA ^ >T 7*^ U t> * n - WtT £ - t 
^'tB^o CCcDNA^ ^ «J -(it N*fflfl^&fl&ft} Lrt: # 'J (A) + RNA<£-^i 

fe^eBBflSr^mu ^^JT(il?Sm&^^L7 , -c^U(A) + RNA^^^ 0 c 
5 mkO>&m^fotZiT\$, l^lil-Bergi£[Okayama,H. and Berg, P., Mol.Cell. 

Biol. ,2: 161-170(1982)], Gubler-HoffmanjfeCGubler, U. and Hoffman, J. Gene, 25 
:263-269(1983)]#£l yfrKZJjfe&mi, ^T&cfct ^ Sfefeg^ o- 
■IZ ftZfcibiZi*^ mMBffllZ&ftzJ:'? + >y t°>^WKato, S. et al. , Gene, 
150 :243-250( 1994) j^fflt U > c cDNAO^ii, y >> >^c; 

(2\ B^J#^3TH£ft£t>©i^ 1113bp^^^^>^@e^iJ^WU 972bp©^ 

323T~ ygJB8l£tfr>£tf5SaK£3- KlTl^o C<DSSM(iT~y^ 
Sffim y h if Is ? > - 4 £76. 3% 1 1 ^ i^C «itfe£W LTC >5 0 

NA ^ >T 7 7 "J * U -— > ^'-T 3 C £ f Z J: £ . *3KflCDcDNA t [WJ— © ? n - 
30 A^o 
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^«@S?U^^t J -cDNASffit(10bpJgUi)^tn^o t>xifcj:j;7>f-f? 
15 BSg>«iWj:^ 

@ 1 ii&mn<D-?^7. 1 KpHP0i049®^it^^r o 

["Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Laboratory, 

19893 triEofco nim$&£&%m&affim*, mam^m^^ ^mm 

25 lZfe^tz 0 cDNA^j&i££jS*[Kato, S. et al. , Gene, 150 = 243-250(1994) Hzft^tz* 



30 



WO 97/03190 PCT/JP96/01899 



5 Ltz^ 3^[0kayama N H. et al. , "Methods in Enzymology" Vol. 164, Academic 
Press, 1987] izm\ 750#g<DnRNA£roUfc«, Cft£20mM h U >USHfiHKftCpH 
7.6X 0.5M NaCl. ImM EOTATSfcf£ Ltaf* >J JdT4?/l/n-x#^A{r^f:j\ ±J§ 
«i3 ^ »J (A) + RNA 10 #g£#fco 

10 cDNA5^7'^'J-g>ffS} 

jtie^U(A) + RNA 10^g^l00mMh';x^^«^(pH 8)fdgft?U RNase^-g- 

Wmj- b >J >7A(pH 6). lmH EDTA > 0. 1% 2-/)Wf h^? J-JK 0. 01% Triton 
-7^7 y n^-Xtt^Dl^^j^jnUT^ ^M100^1T37°ClBfp H lS^$ii:7to S 

* -t-y -7°^ Lfc^ 'J (A) + RN f A^^#7t 0 
SiMr-r *y ytmistztf U (A) + RNA. DNA-RNA+/: 5^ 'J * K 

20 (5' -dG-dG-dG-dG-dA-dA-dT-dT-dC-dG-dA-G-G-A-3' )3nmol£50mM h» 'J XiMm&ffi 
7&(pH 7.5), 0.5mM ATP, 5mM MgCl 2> lOmM 2-/ )\s-ft-f J — ;k 25%^Uai 
^-l^>^Uzi-;l/7j<:^{C^U T4RNA'J^-4r' 50#^^!JDU ^430^1 
T20°C12Bff B 1S^$li:7to ®M^7x / -yltttrj^ x^y-;OT^ lv 
^b-y h^zkK^U ^^^-U^-^y^^P^UCA)^^/^ 

25 *^#^§^U^^7^-pKAl(EP 426455-A)<£-KpnlT7NYL^ 

&m : £Ltzb(D&<??-y?'i'?-ti,Tm\,^zo 
9cizmmLtz*j7*vn**^yzfmQtfvarm 6#g£, ^59-^4 

"7- 1.2^g£T--;l^*r^, 50mM hUxJ^»^(pH 8.3), 75mM KC1, 
30 3mM MgCl 2 , lOmMv^^Xl/'f h- ;k 1. 25mM dNTP(dATP+dCTP+dGTPidHP)^ 
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50mM H ';x^M®^(pfl 7.5). lOOmM NaCK lOmM MgCl 2 . M^^TXlM' h 
mM h U XimmmMpK 7. 5\ lOOmM KCK 4mM MgCl 2 . IOdiM (NH 4 ) 2 S0 4 . 50 txg 

u \Q°c\mmsjz^tzo mi? znu ^ma^u^-- tri 

4if£ftu ^SRNaseH 0.1^^»U ^^lB^ol ^2 0 ClB#^S^$-arfc 0 
1 0 ^cDNA^fi^t^ffl I ^T^m®DH12S(GIBC0-BRL^±$t|)o^K^^^ 

£tf£l00 #g/ml T > tV U >"^^2xYT^j#±-tiLh{r^l ^37^— Bfcig# L/c 0 ^ 
?et(C£L;*:ffi§:©ii a^-£-f£t, ^00 #g/mlT > t°~> U >^f2xHig±li22ml{~^ 
S LT37°C2Btr B ^«^s t- vMK13K 07( 7 7 ^ ->T*±)£^£ 

15 -fe^ £^37°C-Bfci£#U^ 0 i&«^^t'^UT. ®f££±7fffC#t;>\ ffifltf^ 

■7t-^DNA^^SU/c 0 2^|ft^X= HDNAiiEcoRI iNotlTZlS^f kU/c^. 

*m.7 r - vDNA(i N ^3tfe^TS U/cM13 JL^/ -f v- £Taq^ 'J 

^ofc^ ^bfeDNAv'-^yif-CT^-r HX-r^-vX^AXmf^^TcDNA 
©5' 3«j400bp©J&M^J£#i>£UfCo l^'J^-^ii^^e • 7°u^ >cDNvW 

25 cDNA^a-— 

□ tM >i=-^^-X^^U/Co $#rV:7 h^7iT(iGENETYX-MAC(V7 

30 5 -y h tru 5 f - >-4 tru mm v^xmm.^ ltc ^ c ttmm Ltz 0 
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Z.(D? a-><Dm^m 1 K^To cDNA-T Vif- h <D£:igM8&\\&®& LtztZ 
^ 56^05* #H§R^I& 972bp©^---7°> 'J -f>f >^'"7 A, 85bpCD3'# 
881RH^ 37bp0^y(A)-r-;^(a>^a^t^WLTt^(@^iJ#^-3) 0 

10 ^1 



HS MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHMKl^VNFWGQDrcSDV 
******************* . ******. ****. ****. *...*. ** ***. **** *. *. 
RN MAYVPAPGYQPTYNPTLPYKRP I PGGLSVGMS I Y I QG I AKDNMRRFHVNFA VGQDEGAD I 

15 HS AFHFNPl^DGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKWVNGNP 
******************. *. *. **. **. *. ****. ** *****. *. . *********. * 

RN AFHFNPRFDGWDKVVFNTMQSGQWGKEEKKKSMPFQK^^ 
HS FYEYGHRLPLQMVTHLQVDGDLQLQSI NF IGGQPLRPQ- -GPPMMPPYPGPGHCHQQLNS 
**********************. ******. **** . * *. .*.**..*. . . *. ** 

20 RN FYEYGlfl&PLQMVTHUJVDGDmjQSI^ 

HS LPTMEGPPTFNPPVPYFGRLQGGLTARRT 1 1 1 KGY VPPTGKSFA I NFKVGSSGD IALHIN 
**. *. ***. ******* * ***************** **. *. . *******. ****. *, * 

RN LPVMAGPP I FNPPVPYVGTLQGGLTARRT 1 1 IKGYVLPTAKNLI INFKVGSTGDI AFHMN 
HS PMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFDLSIRCGLDRFKVYANGQHLFDFA 
25 **. *. *****.********.**..****.*****************.*********. 

RN PRIGD-CVVRNSYMNGSWGSEERKIPYNPFGAGQFFDLSIRCGTDRFKVFx\NGQHLFDFS 
HS HRLSAFQRVDTLE IQGDVTLSYVQI 

RN HRFQAFQRVDMLE I KGD I TLSYVQ I 
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tUsm £ft*:cDNA©I^iJ£/B I vriiaSffiSfll-r- * -<-XGenBank™/EMBL/DDBJ 
imWmL 1 022# g ^ £1 1 13# g ) d=^W(C— ^-^cDNAcDgp^i^iJCAccession 

i o ^mjKDcMk&m-tz^? ^-phpoio49^i TnT-^^ mmismmm 

<y HrtfStf^n K3-;Kr^oTTO-pfc 0 KpHP01049 2#g£. T N T 

15 tt—y^-£t^^)2nU [ 35 Sj^^^>(T-7-->^Att)8^1(0.37MBq/^l 
. X T7RNA^'J/7— V2nU RNasin 80U£^1&M100^©^^T30 o CT90 
^SJ^^-tirTto ®T>^3 ijl 1 (CSDS-tf- 'J >^'> <7 7 7 -( 125mM h U Xig^gg 
SBflfc. pH6.8, 120mM 2-y;l<7 , 77 e h^y-;k 2% SDS^ N 0. 025%ya^r7 ^ 
y-;ur;U- N 20%^'U-tra-;U)2^1<£-jDx.> 95 0 C3^rB%QMSL/cm> SDS- 

20 ^ ij t ^ U JIT i V¥)\mmmmzti^Wz 0 ;t- h 5 ^Jr?^ 1 4 -#fTttl \ 

S^FM35940 i-^C U C ®cDNA^«j^Cl^iJ#^ 3 Tg£ft35fiK£3- 

25 

-t?7 r a-X4By;l^«( 7 Wl^>T*±)100ml£0. b 'J 

#S8£U7t^ 100mi©0.5Mp»-b 'J r>A(rH®U^o cn{cti^x;u^> 
10ml£»]U ^TlMt^tr^L/Co f ;U£0. 5M^^- h <J -^Atft 
30 10% 7 ^ b-X. 0. 5M#»- h U ^A^(r^UMST-Bfe^^ 
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iZimLtZo fJl&O. SMgSH- h >) ^ 7K 0. 05M l ) >$S»$t(pH 7. 0)Tffi 

h U *A£3fcO. 05M U >$Si«&(pH 7. 0)+. 4°CT{S#L7t 0 
<< y t" h n#HR£jlw&100 ;u l£-fc"7 7T>y * XG-75(C^i *Tx 35 S] 

h-X#5Affl#^A^M20mMMjXi£MW$> pH 7.5> 2raM EDTA. 150mM 
NaCK4mM 2-y;k*77 D h^y-;k 0.01% Triton X-100)20nlT#fc?f^ 0. 3M5 



■f^Xi KpHP01049 1 /Ltg&, 20W©NotIT^tUcm. tyj^timiz^ 

m^frlf, 1. 2kbp©DN T A0ftit£-yVl^ ^ ^ ft U/Co 

^TpMAL TM -c2(-=^--f y K><-r^-5^'Xtt)l #g£. 20i|ite©HindII 1 

mmtLfz^ t^jommz&^mjmdv&'nti^tz* o^-epsti-c^bu 

fc^. \%TJi'u-7.V)\mMil*mz.Mi^ 6.7kbp©DNA^jt^Vl^^9ftL 
20 /Co ^^^-®fK-icDNA^fH-^5^^--> 3 >4-«y HZ^mm, ;»®JM109 

zmnmiLtzo mmmm^^xi kpmalgau^i^u mfmmwm& 

pMALGAL4/JM109<7>— Bfcig#?£l0ml£100 ^g/mlT > ti 'J U -y ^i&Cl 

U v bJ^tzO HJ7°b> 10g> SWSm 5g, NaCl 5g. 2g^ 
25 &)500mll3B®U 37*CTJ8fi*8£U A 6OO ^0. 5(r^o/ci^{C-r v~fu t° 

pH 7.4^ 200mM NaCU M EDTA)25ml{lWU/«i 0 C©^£j@^HS#!sSI& iS 
>I>U Jz&^-^-y K#BK. 5ml©T i □-X*^^^— f >^"5 > K> 
30 5^'Xtt)(C^{tfco t>7^i^CDm&(DJ]7^<v7T-Xt>^l*&ffi&yk, 
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10 ^ N 0.3M5^ h-X^^^^AWMffllT^ffiU^o *mt}UTt/cg6K£- 
SDS-^ 'J ') )VT I YmMfaWrtZM jtztZ 5> 81kDa©teSC^— K 

15 ^atr a #u ^ f - >-4mw&M(D$m, 

7°7Xi KpHP01049 l*/g£\ 20#fe€>NotlT^bUfc^ ? W^MSirj; 
^W^f^mi-p/Co o^T'Aatira&fbL/c^ 0. 8%T#o-X^fVl^m 
SkljfryW. ^lkbp©DNA»f>t^^VI^M^e]^aiL^o oi^ tac^n^-^-. 
^ ^ t°a^- tf©SDE^ rrnBTlT2^- $ ^-^-^WI^^TOIg^ 
20 ? ^-pMPRA3( Japan Kokai Tokkyo koho, Jp 02, 182186)1 ^g^20^.fe©AatI 1 1 
SmalTrfrffcU/c^ 0. %%Tti~u-Wlimm$kmzfr\ m. 8kbp©DNA8frH-^ 

m®JM109^Kf^U/c o fga^b^X; KpMKGAU-AatII^$aU> 

25 

2*©^- 'J 3*7 ? Utf- K^^f ^-PR1(5' -GGGACGTCATGGCCTATGTCX^CCGCACC - 3' ) 
iPR2(5' -GGCGACGTCTG AGCCCXJGATCCTGCCC - 3' ^D&Agi^MT^-f H><-f^- 
•>X^AXfch){Ccfc<9'ftM©7 p a h3-;H:m^iU;o KpHP01049 
^lngiT^^-PRK PR2-€-n^'m00pmole^-ffl^T. PCR+-y K^Stfcbd 
30 X 0cDNA©5' ffiiJI8IRfli&£±tiM U/Co 7x7 -;l/}ft|±}. V 20# 
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*1jl90bp©DNAItfrK-£^Vl^ ^ Hi UfiBfi Uc Q 

KpMKGAL4-AatI I l#g£. 20J£f£©Aatim^L7^ UTJfu- 

1 0 pMKGAL4/JM109®— m&Wfe 10ml<£-100 ^g/ml T > t"-> U >^mBigii!ilOOml fr 

ii^U ±j^^5fe(lliMUfc7^ h-X@^Yb-tr-7rn-X4B7!7 
15 ^A(^-y H^«4.5ml){I^{j-. h-X*7Ai^7AlMiiiimm 

20 stfe&w-r^ c £fi^£titz 0 

*$mi*tiu??->-4mm&w ; £=i- K-r-st kcdna. c<z>t ^cdna^-3- k 
25 icMtscim^ mm, ^izmimmmmtLT, 
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8e?!l#f-: 1 
IE*J©fi£ : 323 

5 Mm 

Met Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 

1 5 10 15 

Leu Pro Tyr Tyr Gin Pro He Pro Gly Gly Leu Asn Val Gly Met Ser 
20 25 30 

10 Val Tyr He Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val 
35 40 45 

Asn Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe 

50 55 60 

Asn Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin 
15 65 70 75 80 

Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys 

85 90 95 

Lys Gly Ala Ala Phe Glu Leu Val Phe He Val Leu Ala Glu His Tyr 
100 105 110 

20 Lys Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu 
115 120 125 

Pro Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu 

130 135 140 

Gin Ser lie Asn Phe He Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro 
25 145 150 155 160 

Pro Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu 

165 170 175 

Asn Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val 
180 185 190 

30 Pro Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He 
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195 200 205 

He He Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn 

210 215 220 

Phe Lys Val Gly Ser Ser Gly Asp He Ala Leu His He Asn Pro Arg 
5 225 230 235 240 

Met Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp 

245 250 255 

Gly Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin 
260 265 270 

10 Phe Phe Asp Leu Ser lie Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr 
275 280 285 

Ala Asn Gly Gin His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe 

290 295 300 

Gin Arg Val Asp Thr Leu Glu He Gin Gly Asp Val Thr Leu Ser Tyr 
15 305 310 315 320 

Val Gin He 

2 

U&\\(DA£ : 969 

20 nmnm mm 

ge?0©^ : cDNA to mRNA 



25 ffiflj 

ATGGCCTATG TCCCCGCACC GGGCTACCAG CCCACCTACA ACCCGACGCT GCCTTACTAC 60 

CAGCCCATCC CGGGCGGGCT CAACGTGGGA ATGTCTGTTT ACATCCAAGC AGTGGCCAGC 120 

GAGCACATGA AGCGGTTCTT CGTGAACTTT GTGGTTGGGC AGGATCCGGG CTCAGACGTC 180 

GCCTTCCACT TCAATCCGCG GTTTGACGGC TGGGACAAGG TGGTCTTCAA CACGTTGCAG 240 

30 GGCGGGAAGT GGGGCAGCGA GGAGAGGAAG AGGAGCATGC CCTTCAAAAA GGGTGCCGCC 300 
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TTTGAGCTGG TCTTCATAGT CCTGGCTGAG CACTACAAGG TGGTGGTAAA TGGAAATCCC 360 

TTCTATGAGT ACGGGCACCG GCTTCCCCTA CAGATGGTCA (XCACCTGCA AGTGGATGGG 420 

GATCTGCAAC TTCAATCAAT CAACTTCATC GGAGGCCAGC CCCTCCGGCC CCAGGGACCC 480 

CCGATGATGC CACCTTACCC TGGTCCCGGA CATTGCCATC AACAGCTGAA CAWETGCCC 540 

5 ACCATGGAAG GACCCCCAAC CTTCAACCCG CCTGTGCCAT ATTTCGGGAG GCTGCAAGGA 600 

GGGCTCACAG CTCGAAGAAC CATCATCATC AAGGGCTATG TGCCTCCCAC AGGCAAGAGC 660 

TTTGCTATCA ACTTCAAGGT GGGCTCCTCA GGGGACATAG CTCTGCACAT TAATCCCCGC 720 

ATGGGCAACG GTACCGTGGT CCGGAACAGC CTTCTGAATG GCTCGTGGGG ATCCGAGGAG 780 

AAGAAGATCA COCACAACCC ATTTGGTCCC GGACAGTTCT TTGATCTGTC CATTCGCTGT 840 

10 GGCTTGGATC GCTTCAAGGT TTACGCCAAT GGCCAGCACC TCTTTGACTT TGCCCATCGC 900 

CTCTCGGCCT TCCAGAGGGT GGACACATTG GAAATCCAGG GTGATGTCAC CTTGTCCTAT 960 

GTCCAGATC 969 

mm^ ■. 3 

I£?lJ<Dfi£ : 1113 

15 miom : mm 

m<m : rMA 

SS^lJ©^® : cDNA to mRNA 



20 . : *-t=-^b°oiyX 

Jn-yQ : HP01049 
: CDS 

25 : 57. . 1029 

ATCTCCCACT CCTGCAGCTC TTCTCACAGG ACCAGCCACT AGCGCAGCCT CGAGCG ATG 59 

Met 

30 1 
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10 



15 



20 



25 



GCC TAT 
Ala Tyr 

CCT TAC 
Pro Tyr 

TAC ATC 
Tyr He. 
35 

TTT CTG 
Phe Val 
50 
CCG CGG 
Pro Arg 

GGG AAG 
Gly Lys 

GGT GCC 
Gly Ala 

GTG GTG 
Val Val 
115 
CTA CAG 
Leu Gin 
130 

TCA ATC 
Ser He 



GTC CCC 
Val Pro 
5 

TAC CAG 
Tyr Gin 
20 
CAA GGA 
Gin Gly 

GTT GGG 
Val Gly 

TTT GAC 
Phe Asp 

TGG GGC 
Trp Gly 
85 

GCC TTT 
Ala Phe 
100 

GTA AAT 
Val Asn 

ATG GTC 
Met Val 

AAC TTC 
Asn Phe 
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GCA CCG 
Ala Pro 

CCC ATC 
Pro He 

GTG GCC 
Val Ala 

CAG GAT 
Gin Asp 
55 

GGC TGG 
Gly Trp 
70 
AGC GAG 
Ser Glu 

GAG CTG 
Glu Leu 

GGA AAT 
Gly Asn 

ACC CAC 
Thr His 
135 
ATC GGA 
lie Gly 
150 



GGC TAC CAG 
Gly Tyr Gin 
10 

CCG GGC GGG 
Pro Gly Gly 
25 

AGC GAG CAC 
Ser Glu His 
40 

CCG GGC TCA 
Pro Gly Ser 

GAC AAG GTG 
Asp Lys Val 

GAG AGG AAG 
Glu Arg Lys 
90 

GTC TTC ATA 
Val Phe He 
105 

CCC TTC TAT 
Pro Phe Tyr 
120 

CTG CAA GTG 
Leu Gin Val 

GGC CAG CCC 
Gly Gin Pro 



CCC ACC 
Pro Thr 

CTC AAC 
Leu Asn 

ATG AAG 
Met Lys 

GAC GTC 
Asp Val 
60 

GTC TTC 
Val Phe 
75 
AGG AGC 
Arg Ser 

GTC CTG 
Val Leu 

GAG TAC 
Glu Tyr 

GAT GGG 
Asp Gly 
140 
CTC CGG 
Leu Arg 
155 



TAC AAC 
Tyr Asn 

GTG GGA 
Val Gly 
30 

CGG TTC 
Arg Phe 
45 
GCC TTC 
Ala Phe 

AAC ACG 
Asn Thr 

ATG CCC 
Met Pro 

GCT GAG 
Ala Glu 
110 
GGG CAC 
Gly His 
125 

GAT CTG 
Asp Leu 

CCC CAG 
Pro Gin 



CCG ACG CTG 
Pro Thr Leu 
15 

ATG TCT GTT 
Met Ser Val 

TTC GTG AAC 
Phe Val Asn 

CAC TTC AAT 
His Phe Asn 
65 

TTG CAG GGC 
Leu Gin Gly 
80 

TTC AAA AAG 
Phe Lys Lys 
95 

CAC TAC AAG 
His Tyr Lys 

CGG CTT CCC 
Arg Leu Pro 

CAA CTT CAA 
Gin Leu Gin 
145 

GGA CCC CCG 
Gly Pro Pro 
160 
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25 



ATG ATG 
Met Met 

ACC CTG 
Ser Leu 

TAT TTC 
Tyr Phe 
195 
ATC AAG 
He Lys 
210 

AAG GTG 
Lys Val 

GGC AAC 
Gly Asn 

TCC GAG 
Ser Glu 

TTT GAT 
Phe Asp 
275 
AAT GGC 
Asn Gly 
290 

AGG GTG 
Arg Val 



30 



CCA CCT TAC 
Pro Pro Tyr 

165 
CCC ACC ATG 
Pro Thr Met 
180 

GGG AGG CTG 
Gly Arg Leu 

GGC TAT GTG 
Gly Tyr Val 

GGC TCC TCA 
Gly Ser Ser 
230 

GGT ACC GTG 
Gly Thr Val 

245 
GAG AAG AAG 
Glu Lys Lys 
260 

CTG TCC ATT 
Leu Ser He 

CAG CAC CTC 
Gin His Leu 

GAC ACA TTG 
Asp Thr Leu 
310 



CCT GGT 
Pro Gly 

GAA GGA 
Glu Gly 

CAA GGA 
Gin Gly 
200 
CCT CCC 
Pro Pro 
215 

GGG GAC 
Gly Asp 

GTC CGG 
Val Arg 

ATC ACC 
He Thr 

CGC TGT 
-Arg Cys 
280 
TTT GAC 
Phe Asp 
295 

GAA ATC 
Glu lie 



CCC GGA 
Pro Gly 
170 
CCC CCA 
Pro Pro 
185 

GGG CTC 
Gly Leu 

ACA GGC 
Thr Gly 

ATA GCT 
He Ala 

AAC AGC 
Asn Ser 
250 
CAC AAC 
His Asn 
265 

GGC TTG 
Gly Leu 

TTT GCC 
Phe Ala 

CAG GGT 
Gin Gly 



CAT TGC 
His Cys 

ACC TTC 
Thr Phe 

ACA GCT 
Thr Ala 

AAG AGC 
Lys Ser 
220 
CTG CAC 
Leu His 
235 

CTT CTG 
Leu Leu 

CCA TTT 
Pro Phe 

GAT CGC 
Asp Arg 

CAT CGC 
His Arg 
300 
GAT GTC 
Asp Val 
315 



CAT CAA CAG 
His Gin Gin 
175 

AAC CCG CCT 
Asn Pro Pro 

190 
CGA AGA ACC 
Arg Arg Thr 
205 

TTT GCT ATC 
Phe Ala He 

ATT AAT CCC 
He Asn Pro 

AAT GGC TCG 
Asn Gly Ser 
255 

GGT CCC GGA 
Gly Pro Gly 

270 
TTC AAG GTT 
Phe Lys Val 
285 

CTC TCG GCC 
Leu Ser Ala 

ACC TTG TCC 
Thr Leu Ser 



CTG AAC 
Leu Asn 

GTG CCA 
Val Pro 

ATC ATC 
He He 

AAC TTC 
Asn Phe 
225 
CGC ATG 
Arg Met 
240 

TGG GGA 
Trp Gly 

CAG TTC 
Gin Phe 

TAC GCC 
Tyr Ala 

TTC CAG 
Phe Gin 
305 
TAT GTC 
Tyr Val 
320 
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CAG ATC TAATCTATTC CTGGCGCCAT AACTCATGGG AAAACAGAAT TATCC 1070 
Gin lie 

CCTAGGACTC CTTTCTAAGC CCCTAATAAA ATGTCTGAGG GTG 1113 
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